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Figure 2 
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ST3 Gal III activities of the Amyiose purified 
refolded MBP-ST3Gal III fractions 





Figure 3 
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Figure 4 
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Figure 5 
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GnTl Cysl21Ser mutant 

avipiMacdretvnSldkllhy^ 
lgqv^^aavvveddlevapdffeyfotypfflc^ 
lepkwpkafwddwmnpeqrqgraci^^ 
apqlqvekvrtndrkelgeyirqytgrdsft^^ 



Gcggtgattcc«atcctggtcatcgcctgtgaccgcagcactgttcggcgctctctagaoaagctgctgcattatcg^ 

gctcttccccatcatcgttagccaggactgcgggcacgaggagacggccoaggccatcgcctcctacggcagcgcggtcacgcaoa 

tccggcagcccgacctgagcagcattgcggtgccgccggaccaccgcaagttccagggctactacaagatogcgcgccactaccg 

ctgggcgctggg«»ggtcttccggcagtttcgcttc^^ 

cgagtactttcgggccacotatccgctgctgaaggcogacccotecctgtggtgcgtctcggcctggaa^^ 
gatggtggacgccagcaggcctgagctgctctaccgcaccga^ 

gagctggagcccaagtggocaaaggccttetgggacgactggatgcggcggccggagcagcggcaggggcgggcctgcatacg 

ccctgagatctcaagaacgatgacctttggccgcaagggtgtgagccacgggcagttctttgaccagcacctcaagtttatca^^ 

accagcagtttgtgcacttcacccagctggacctg^^ 

gctccccagctgcaggtggagaaagtgaggaccaatgaccggaaggagctgggggaggtgogggtgcagtatacgggoaggga 

cagcttcaaggctttcgccaaggctctgggtgtcatggatgaccttaagtcgggggttccgagagctggctaccggggtattgtc^ 

ccagttcccgggccgccgtgtccacsctggcgcccccaccgaogtgggagggctatgatcctagctggaattag 



Figure 7 
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GnTl Cysl21Asp 



avipilviacdrstvrrdldWlhy^ 
lgqvfftjiHpaavweddlevapdfFey^ 
lepkv^kafwddwmrrpeqrqgracirpe 
apqlqvekvrtndrkelgevr^qytgrdsfkafakalgvm 

Gcggtgattcccatcctggtcatcgcct^^ 

agctettccccatcatogttagccaggactgcgggcacgaggagacggcccaggccatcgcctcctac^^ 

atccggcagcccgacctgagcagcattgcggtgwgccggaccaccgcaagttccagggctactacaagatcgcgc 

gctgggcgotgggccaggtcttccggcagtttcgcttccccgcggccgtggtggt^ 

tcgagtactttcgggccacctatccgctgctgaaggccgacccctccctgtggtgcgtctcggcctggaa^ 

agatggtggacgccagcaggcctgagctgctctaccgcaccgactttttccctggTO^ 

tgagctggagcccaagtggccaaaggccttctgggacgactggatgcggcggccggagca^ 

gccctgagatctcaagaacgatgacctttggccgcaagggtgtgagccacgggcagttctttga^ 

aaccagcagtttgtgcacttcacccagctggacct^ 

tgctccccagctgcaggtggagaaagtgaggaccaatgaccggaaggagctgggggaggtgcgggtgcagtatacgggcaggga 

cagcttcaaggctttcgccaaggctctgggtgtcatggatgaccttaagtcgggggttccgagagctggcte^ 

ccagttcccgggccgccgtgtccacctgg 



Figure 8 
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GnTl Cysl21Thr 



avipilviacdrstvrrtldklfoyipsael^^^ 
gqvfrq^aavvveddlevapdffeyfotypll^ 
epkwpkafwddwmnpeqrqgraciipd^ 
apqlqvekvrtndrkelgevrvqytgixlsfc^^ 



Gcggtgattcccatcctggteatcgcctgtgac^ 

agctct^ccatcatcgttagccaggactgcgggcacgaggagacggcccaggccatcgcctcctacggcagcgcggtcacgcac 
atccggcagcccgacctgagcagcattgcg#gw^ 

gctgggcgctgggccaggtottccggcagtttcgcttc<xcgcggccgtggtggtggaggatgacctggaggtggccccgg^ 

tcgagtactttcgggccacctatccgctgctgaaggccgacccctccctgtggtgcgtotcggcotggaatgac^ 

agatggtggacgccagcaggcctgagctgctetaccg^ 

tgagctggagcccaagtggccaaaggccttctgggacgactggatgcggcggccggagcagcggcaggggcgggcctgcatac 

gccctgagatctcaagaacgatgacctttggccgcaagggtgtgagccacgggcagttctttgaccagcacctcaagtttatcaagctg 

aaccagcagtttgtgcacttcacccagctggacctgtcttacctgcagcgggaggcctatgaccgagatttcctcgcccgcgtctacgg 

tgctccccagctgcaggtggagaaagtgaggaccaatgaccggaaggagctgggggaggtgcgggtgcagtatacgggcaggga 

cagcttcaaggctttcgccaaggctctgggfgtcatgga 

ccagttcccgggcogccgtgtccacctggcgcccccaccgacgtgggagggctatgatcctagctggaattag 



Figure 9 
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GnTl Cysl21Ala 



avipilviacdrstvrraldkl^ 
lgqvfrqiHpaavweddlevapd 
lepkwpkafwddwmnpeqrqgraciipeis 
apqlqvekvrtndrkelgevi^^ 



Gcggtgattcccatcctggtcatcgcctgtgaccgcagcactgttcggcgcgccctagaca^ 

agctcttecccatcatcgttagcc^ 

atccggcagcccgacctgagcagcattgcggtgcc^^ 

gctgggcgctgggccaggtcttcoggcagtttcgcttccccgcggccgtggtggtggagga 
tcgagtacmcgggccacotatccgctgctgaaggccgacccctccctgtg 
agatggtggacgccagcaggcctgagctgctct^^ 
tgagctggagcccaagtggwaaaggcctt^ 

gcc^tgagatctcaagaacgatgacctttggccgcaagggtgtgagccacgggcagtt 

aaccagoagWgtgcaottcacccagctggacctgtcttacctgcagcgggagg 

tgctccccagctgcaggtggagaaagtgaggacc^ 

cagcttcaaggcmcgccaaggctctggg^ 

ccagttcccgggccgccgtgtccac^ 



Figure 10 
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GnTl Argl20Ala, Cysl21H 



avipilviacdretvmhldkltoyrpsa^ 
algqvfiqfrfpaavvveddlevapdffey^^ 
elepkwpkafwddvrairrpeqrqgrac^ 
gapqlqvekvrtndrkelgevivqytgr^ 

Gcggtgattcccatcctggtcatcgcctgte^ 



atccggcagcccgacctgagcagoattgcggtgccgccggaccaccgcaagttccagggctactacaagatcgogcgcoacta^^ 
gctgggcgctgggccaggJcttccggcagTOcgcttccxx^ 



agatggtggacgccagcaggcctgagctgctctacxgcaccgactttttccctggcctgggctggctgctgttggc 



tgctccccagctgcaggtggagaaagtgaggaccaatgaccggaaggagctgggggaggtgcggg^gcagtatacgggcaggga 
cagctteaaggctttcgccaaggctetgggtgtctf^ 



Figure 1 1 
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Rat Liver ST3Gal III amino acid sequence: 

MGLL VFVRNT.TJ.A LCLFL VLGFLYYSA yT / KLHLLOWEDSNSLILSLDSAGQTLGTEYDRL 

GFLIXLDSKLPAELATKYA>nFSEGACKPGYASAMMTAIFPRFSKPAPMFLDDSFRKW 

ARmEFWPFGIKGQDNLIKAILSVTBCEYRLTPAlJ^SLHCRRCIIVGNGGVLANKSLGS 

PJDDYDIVIEI,NSAPVKGFEKJDVGSKTTLRITYPEGAMQRPEQYERDSLFVLAGFKW 

QDFKWLKYIVYKERVSASDGFWKSVATRVPKEPPEIRILNPYFIQEAAFTLIGLPFNN 

GmGRGNIPTLGSVAVTMALDGCDEVAVAGFGYDMNTPNAPLHYYETVRMAAIKE 

SWTHNIQREKEFLRKXVKARVITDLSSGI 



Figure 12 
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Full length UDP-N-acetylgalactosaminyltransferase 2 (GalNAcT2) nucleic acid and 

amino acid sequences 



Amino acid sequence 

Met Arg Arg Arg Ser Arg Met Leu Leu Cys Phe Ala Phe Leu Trp Val 
15 10 15 

Leu Gly He Ala Tyr Tyr Met Tyr. Ser Gly Gly Gly Ser Ala Leu Ala 
20 25 30 

Gly Gly Ala Gly Gly Gly Ala Gly Arg Lys Glu Asp Trp Asn Glu He 
35 40 45 

Asp Pro He Lys Lys Lys Asp Leu His His Ser Asn Gly Glu Glu Lys 
50 55 60 

Ala Gin Ser Met Glu Thr Leu Pro Pro Gly Lys Val Arg Trp Pro Asp 
65 70 75 80 

Phe Asn Gin Glu Ala Tyr Val Gly Gly Thr Met Val Arg Ser Gly Gin 
85 90 95 

Asp Pro Tyr Ala Arg Asn Lys Phe Asn Gin Val Glu Ser Asp Lys Leu 
100 105 110 

Arg Met Asp Arg Ala He Pro Asp Thr Arg His Asp Gin Cys Gin Arg 
115 .120 125 

Lys Gin Trp Arg Val Asp Leu Pro Ala Thr Ser Val Val He Thr Phe 
130 135 140 

His Asn Glu Ala Arg Ser Ala Leu Leu Arg Thr Val Val Ser Val Leu 
145 150 155 160 

Lys Lys Ser Pro Pro His Leu He Lys Glu He He Leu Val Asp Asp 
165 170 175 

Tyr Ser Asn Asp Pro Glu Asp Gly Ala Leu Leu Gly Lys He Glu Lys 
180 185 190 

Val Arg Val Leu Arg Asn Asp Arg Arg Glu Gly Leu Met Arg Ser Arg 
195 - 200 205 

Val Arg Gly Ala Asp Ala Ala Gin Ala Lys Val Leu Thr Phe Leu. Asp 
210 215 220 

Ser His Cys Glu Cys Asn Glu His Trp Leu Glu Pro Leu Leu Glu Arg 
225 230 235 240 

Val Ala Glu Asp Arg Thr Arg Val Val Ser Pro He He Asp Val He 
245 250 255 

Asn Met Asp Asn Phe Gin Tyr Val Gly Ala Ser Ala Asp Leu Lys Gly 
260 265 270 

Gly Phe Asp Trp Asn Leu Val Phe Lys Trp Asp Tyr Met Thr Pro Glu 
275 280 285 



FIG. 13A (1/2) 
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Gin Arg Arg Ser Arg Gin Gly Asn Pro Val Ala Pro lie Lys Thr Pro 
290 295 300 

Met He Ala Gly Gly Leu Phe Val Met Asp Lys Phe Tyr Phe Glu Glu 
305 310 315 320 

Leu Gly Lys Tyr Asp Met Met Met Asp Val Trp Gly Gly Glu Asn Leu 
325 330 335 

Glu He Ser Phe Arg Val Trp Gin Cys Gly Gly Ser Leu Glu He He 
340 345 350 

Pro Cys Ser Arg Val Gly His Val Phe Arg Lys Gin His Pro Tyr Thr 
355 360 365 

Phe Pro Gly Gly Ser Gly Thr Val Phe Ala Arg Asn Thr Arg Arg Ala 
370 375 380 

Ala Glu Val Trp Met Asp Glu Tyr Lys Asn Phe Tyr Tyr Ala Ala Val 
385 390 395 400 

Pro Ser Ala Arg Asn Val Pro Tyr Gly Asn He Gin Ser Arg Leu Glu 
405 410 415 

Leu Arg Lys Lys Leu Ser Cys Lys Pro Phe Lys Trp Tyr Leu Glu Asn 
420 425 430 

Val Tyr Pro Glu Leu Arg Val Pro Asp His Gin Asp lie Ala Phe Gly 
435 . ~ 440 445 

Ala Leu Gin Gin Gly Thr Asn Cys Leu Asp Thr Leu Gly His Phe Ala 
450 455 460 

Asp Gly Val Val Gly Val Tyr Glu Cys His Asn Ala Gly Gly Asn Gin 
465 470 475 480 

Glu Trp Ala Leu Thr Lys Glu Lys Ser Val Lys His Met Asp Leu Cys 
485 490 495 

Leu Thr Val Val Asp Arg Ala Pro Gly Ser Leu He Lys Leu Gin Gly 
500 505 510 

Cys Arg Glu Asn Asp Ser Arg Gin Lys Trp Glu Glh He Glu Gly Asn 
515 520 525 

Ser Lys Leu Arg His Val Gly Ser Asn Leu Cys Leu Asp Ser Arg Thr 
530 535 540 

Ala Lys Ser Gly Gly Leu Ser Val Glu Val Cys Gly Pro Ala Leu Ser 
545 550 555 560 

Gin Gin Trp Lys Phe Thr Leu Asn Leu Gin Gin 
565 570 



FIG. 13A(2/2) 
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Nucleic acid sequence 

atgcggcggc gctcgcggat gctgctctgc ttcgccttcc tgtgggtgct gggcatcgcc 60 

tactacatgt actcgggggg cggctctgcg ctggccgggg gcgcgggcgg cggcgccggc 120 

aggaaggagg actggaatga aattgacccc attaaaaaga aagaccttca tcacagcaat 180 

ggagaagaga aagcacaaag catggagacc ctccctccag ggaaagtacg gtggccagac 240 

tttaaccagg aagcttatgt tggagggacg atggtccgct ccgggcagga cccttacgcc 300 

cgcaacaagt tcaaccaggt ggagagtgat aagcttcgaa tggacagagc catccctgac 360 

acccggcatg accagtgtca gcggaagcag tggcgggtgg atctgccggc caccagcgtg 420 

gtgatcacgt ttcacaatga agccaggtcg gccctactca ggaccgtggt cagcgtgctt 480 

aagaaaagcc cgccccatct cataaaagaa atcatcttgg tggatgacta cagcaatgat 540 

cctgaggacg gggctctctt ggggaaaatt gagaaagtgc gagttcttag a.aatgatcga 600 

cgagaaggcc tcatgcgctc acgggttcgg ggggccgatg ctgcccaagc caaggtcctg 660 

accttcctgg acagtcactg cgagtgtaat gagcactggc tggagcccct cctggaaagg 720 

gtggcggagg acaggactcg ggttgtgfcca cccatcatcg atgtcattaa tatggacaac 780 

tttcagtatg tgggggcafcc tgctgacttg aagggcggtt ttgattggaa cttggtattc 840 

aagtgggatt acatgacgcc tgagcagaga aggtcccggc aggggaaccc agtcgcccct. 900 

ataaaaaccc ccatgattgc tggtgggctg tttgtgatgg ataagttcta ttttgaagaa 960 

ctggggaagt acgacatgat gatggatgtg tggggaggag agaacctaga gatctcgttc 1020 

cgcgtgtggc agtgtggtgg "cagcctggag atcatcccgt gcagccgtgt gggacacgtg 1080 

ttccggaagc agcaccccta cacgttcccg ggtggcagtg gcactgtctt tgcccgaaac 1140 

acccgccggg cagcagaggt ctggatggat gaatacaaaa atttctatta tgcagcagtg 1200 

ccttctgcta gaaacgttcc ttatggaaat attcagagca gattggagct taggaagaaa 1260 

ctcagctgca agcctttcaa atggtacctt gaaaatgtct atccagagtt aagggttcca 1320 

gaccatcagg atatagcttt tggggccttg cagcagggaa ctaactgcct cgacactttg 1380 

ggacactttg ctgatggtgt ggttggagtt tatgaatgtc acaatgctgg gggadaccag 1440 

gaatgggcct tgacgaagga gaagtcggtg aagcacatgg atttgtgcct tactgtggtg 1500 

gaccgggcac cgggctctct tataaagctg cagggctgcc gagaaaatga cagcagracag 1560 

aaatgggaac agatcgaggg caactccaag ctgaggcacg tgggcagcaa cctgtgcctg 1620 

gacagtcgoa cggccaagag cgggggccta agcgtggagg tgtgtggccc ggccctttcg 1680 

cagcagtgga agttcacgct caacctgcag cag 1713 

Figure 13B 
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A51 UDP-N-acetylgalactosaminyitransferase 2, GaINAcT2, nucleic acid and amino acid 

sequences 

Amino acid sequence 

Lys Lys Lys Asp Leu His His Ser Asn Gly Glu Glu Lys Ala Gin Ser 
1 5 10 15 

Met Glu Thr Leu Pro Pro Gly Lys Val Arg Trp Pro Asp Phe Asn Gin 
20 25 30 

Glu Ala Tyr Val Gly Gly Thr Met Val Arg Ser Gly Gin Asp Pro Tyr 
35 40 45 

Ala Arg Asn Lys Phe Asn Gin Val Glu Ser Asp Lys Leu Arg Met Asp 
50 55 60 

Arg Ala He Pro Asp Thr Arg His Asp Gin Cys Gin Arg Lys Gin Trp 
*5 70 .75 ™ 80 

Arg Val Asp Leu Pro Ala Thr Ser Val Val He Thr Phe His Asn Glu 
85 90 95 

. Ala Arg Ser Ala Leu Leu Arg Thr Val Val Ser Val Leu Lys Lys Ser 
100 105 110 

Pro Pro His Leu He Lys Glu He He Leu Val Asp Asp Tyr Ser Asn 
115 120 125 

Asp Pro Glu Asp Gly Ala Leu Leu Gly Lys He Glu Lys Val Arg Val 
130 135 140 

Leu Arg Asn Asp Arg Arg Glu Gly Leu Met Arg Ser Arg Val Arg Gly 
14 S 150 155 " 160 

Ala Asp Ala Ala Gin Ala Lys Val Leu Thr Phe Leu Asp Ser His Cys 
165 170 175 

Glu Cys Asn Glu His Trp Leu Glu Pro Leu Leu Glu Arg Val Ala Glu 
180 185 190 

Asp Arg Thr Arg Val Val Ser Pro He He Asp Val He Asn Met Asp 
155 200 205 

Asn Phe Gin Tyr Val Gly Ala Ser Ala Asp . Leu Lys Gly Gly Phe Asp 
210 215 220 

Trp Asn Leu Val Phe Lys Trp Asp Tyr Met Thr Pro Glu Gin Arg Arg 
225 230 235 240 

Ser Arg Gin Gly Asn Pro Val Ala Pro He Lys Thr Pro Met He Ala 
245 250 255 

Gly Gly Leu Phe Val Met Asp Lys Phe Tyr Phe Glu Glu Leu Gly Lys 
260 265 270 

Tyr Asp Met Met Met Asp Val Trp Gly Gly Glu Asn Leu Glu He Ser 
275 280 285 

Phe Arg Val Trp Gin Cys Gly Gly Ser Leu Glu He He Pro Cys Ser 



FIG. 14A(l/2) 
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290 295 300 

Arg Val Gly His Val Phe Arg Lys Gin His Pro Tyr Thr Phe Pro Gly 
305 310 315 320 

Gly Ser Gly Thr Val Phe Ala Arg Asn Thr Arg Arg Ala Ala Glu Val 
325 330 335 

Trp Met Asp Glu Tyr Lys Asn Phe Tyr Tyr Ala Ala Val Pro Ser Ala 
340 345 350 

Arg Asn Val Pro Tyr Gly Asn He Gin Ser Arg Leu Glu Leu Arg Lys 
355 360 365 

Lys Leu Ser Cys Lys Pro Phe Lys Trp Tyr Leu Glu Asn Val Tyr Pro 
370 375 380 

Glu Leu Arg Val Pro Asp His Gin Asp He Ala Phe Gly Ala Leu Gin 
385 390 395 400 

Gin Gly Thr Asn Cys Leu Asp. Thr Leu Gly His Phe Ala Asp Gly Val 
405 410 415 

Val Gly Val Tyr Glu Cys His Asn Ala Gly Gly Asn Gin Glu Trp Ala 
420 425 430 

Leu Thr Lys Glu Lys Ser Val Lys His Met Asp Leu Cys Leu Thr Val 
435 440 445 

Val Asp Arg Ala Pro Gly Ser Leu lie Lys Leu Gin Gly Cys Arg Glu 
450 455 460 

Asn Asp Ser Arg Gin Lys Trp Glu Gin lie Glu Gly Asn Ser Lys Leu 
465 470 475 480 

Arg His Val Gly Ser Asn Leu Cys Leu Asp Ser Arg Thr Ala Lys Ser 
485 490 495 

Gly Gly Leu Ser Val Glu Val Cys Gly Pro Ala Leu Ser Gin Gin Trp 
500 505 510 

Lys Phe Thr Leu Asn Leu Gin Gin 
515 520 



FIG. 14A(2/2) 
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Nucleic acid sequence 



aaaaagaaag accttcatca 


cagcaatgga 


gaagagaaag 


cacaaagcat 


ggagaccctc 


60 


cctccaggga aagtacggtg 


gccagacttt 


aaccaggaag 


cttatgttgg 


agggacgatg 


120 


atccactcca oocacrcraccc 


ttacgcccgc 


aacaagttca 


accaggtgga 


gagtgataag 


180 


cttcaaataa acacracrccafc 


ccctgacacc 


cggcatgacc 


agtgtcagcg 


gaagcagtgg 


240 


ccrcfcrtaaatc tcrcccraccac 


cacrccrtcrcrtcf 


atcacgtttc 


acaatgaagc 


caggtcggcc 


300 


CLciL Lwayya wuy uyy wciy 


ecrtcfcttaaci 


aaaacrccccrc 


cccatctcat 


aaaagaaatc 


360 




vftftvgClvvv w 


cracf era coorcrcr 


ctctCwtggg 


gaaaattgag 


420 


ctciGiy u»y uy cty uuyu t-ciy ddc*. 


v»y« uvy »wy w. 


oaaacrcctca 


tgcgctcacg 


gg 1 1 cggggg 


480 


gccgacgctg cccaayccaa 


yy v»w w wya^^ 


fc t e c toaac a 

w w w w »55*»** 


atcactcfccfa 


gtgtaatgag 


540 


cact-ggcxgg aycccctcct 


y y aacigy y ty 


crecr cr acre* ac a 


33 RVWV 333 w 


tgtgtcaccc 


600 


auCatCyauy tCottaotot 


yyciv^ciciv> l* w 


c acr t a t cr t cr cr 


acracat c tcf c 


tgacttgaag 


660 


yyGyy wtbuy at *»9y daw u t- 




taaaattaca 


tgacgcctga 


gcagagaagg 


720 


ccccggcagg ygaacccayt 


/*» nr* f c+ e* "fr" a a 

V^WWUL Lut CI 


aaaaccccca 


tcrattcfCtcrcf 


tgaactgttt 


780 


y tyaLyyaUa. ay LUL-L-aUCU 




aacraacfwaccf 


acatgatgat 


ggatgtgtgg 


840 




ctcattccoc 


atataacacft 


gtggtggcag 


cctggagatc 


900 


dHwU-i^y uy wet yuuywywyyy 


acaccrfccrfctc 


cggaagcagc 


acccctacac 


gttcccgggt 


960 




ccgaaacacc 


cgccgggcag 


cagaggtctg 


gatggatgaa 


1020 


tacaaaaatt tctattatcrc 


agcagtgccfc 


tctgctagaa 


acgttcctta 


tggaaatatt 


1080 


cagagcagat tggagcttag 


gaagaaactc 


agctgcaagc 


ctttcaaatg 


gtaccttgaa 


1140 


aatgtctatc cagagttaag 


ggttccagac 


catcaggata 


tagcttttgg 


ggccttgcag 


1200 


cagggaacxa actgccwcga 


cactuugyga 


woiw* *-» v v»y w *->y 


ataatcrtacft 


tcfcraatttat 


1260 


gaatgtcaca atgctggggg 


aaaccaggaa 


tgggccttga 


cgaaggagaa 


gtcggtgaag 


1320 


cacatggatt tgtgccttac 


tgtggtggac 


cgggcaccgg 


gctctcttat 


aaagctgcag 


1380 


ggctgccgag aaaatgacag 


cagacagaaa 


tgggaacaga 


tcgagggcaa 


ctccaagctg 


1440 


aggcacgtgg gcagcaacct 


gtgcctggac 


agtcgcacgg 


ccaagagcgg 


gggcctaagc 


1500 


gtggaggtgt gtggcccggc 


cctttcgcag 


cagtggaagt 


tcacgctcaa 


cctgcagcag 


1560 



Figure 14B 



WO 2005/089102 PCT/US2005/003856 




WO 2005/089102 



PCT/US2005/003856 




WO 2005/089102 



PCT/US2005/003856 




WO 2005/089102 



PCT/US2005/003856 




WO 2005/089102 



PCT/US2005/003856 




WO 2005/089102 



PCT/US2005/003856 




FIG. 19 
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Figure 24 



a) MBP-pST3Gall fusion protein 

MKIEEGKLVIWINGDKGYNGLAEVQKKFEKDTGIKVTVEHPDKLEEKFPQVA.\TGDGPDIIFW^DKF 

gctaSgl^ 

^^toysiaeaWgetamtingpwaws 

S^kENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPPJAATMENAQKGEI^ 
WYA^AVINAASGRQTVDEALKDAQTNSSSN^^ 

PCTCTRCIEEQRVSAWFDERFNRSMQPLLTAKNAHLEEDTYKWLPXQREKQP^ 
GNVDPLLE^^ 

GRWSTGIL^ 
KIRIFKGR 

b) MBPSBD-pST3Gall fusion protein 

MKIEEGKLWINGDKGYNGLAEVGKl^EKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHpRF 

g^q^glYae^ 

KELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDl^^ 
S^Y^AA^ 

NKELAKLEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEI^ 
WYAWTAVINAASGRQTVDEALKDAQTNSSSNNNNM^^ 

SGSVTSTSKTTATASKTSTSTSSTSCTTPTAVAVTFDLTATTTYGENlYLVGSISQLGDWETSDGIALSAD 

KYTSSDPLWYVTVTLPAGESFEYKFIRIESDDSVEWESDPNREYTWQACGTSTATVTDTW 

FKKLMKYPYRPCTCTRCIEEQRVSAWFDERFNRSMQPLLTAKNAHLEEDTYKWLRLQREKQPNNLN 

DTIRELFOVWGNVDPLLEKRLVSCRRCAWGNSGNLKESYYGPQroSHDFVLRMNKAPTEGFEADVG 

SKTTHHFVYPESFRELAQEVSM1LWFKTTDLEWVISATTTGMSHTYW 

YVFDRWLQGHGRYPSTGILSVIFSLHICDEVDLYGFGADSKGNWHHYWENNPSAGAFRKTGVHDGDF 
ESNVTTILASINKIRIFKGR 
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Figure 25 
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Figure 26 



A. 

MKIEEGKLVIWINGDKGYN 

GGYAQSGLLAEITPDKAFQDKLYPFTWDAWYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALD 
KELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDN^ 

MNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASP 

NKELAKEFLENYLLTDEGLEAWKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQM 

WYAVRTAVINAASGRQTVDEALKDAQTN^ 

STRKTPMATGAVPAKKKWQATKSPASSPHPTTRRRQRLKASEFKSEPRWDFEEEYSLDMSSLQT 

NCSASVKEKASKSPWLQNIFLPNITLFLDSGRFTQSEWNRLEHFAPPFGFMELNQSLVQKWTI^ 

PVRQQQLLLASLPTGYSKCITCAVVGNGGILNDSRVGREIDSHDYVFRLSGAVIKGYEQDVGTRT 

SFYGFTAFSLTQSILILGRRGFQHVPLGKDVRYLHFLEGTRNYEWLEAMFLNQTLAKTHLSWFR 

HRPQEAFRNALDLDRYLLLHPDFLRYMKNRFLRSKTLDTAXWMYRPTTGALLLLTAL 

SAYGFITEGIIERFSDHYYDTSWKRLIFYINHDFRLERMVWKRLHDEGnW 



B. 

MKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIff 

GGYAQSGLLAEITPDKAEQDKLYPFTWDAVRYNG 

KELKAKGKSALMFNLQEPYFTWPLIAADG 

MNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFV 

NKELAK^FLENYLLTDEGLEAWKDKPLGAVALKSYEEELAKDPWAATMENAQKGEIMPNIPQMS^ 

WYAVRTAVINAASGRQTVDEALKDAQTN^^ 

MKERSLQSLAKPKSQAPTRARRTTIYAEPVPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDK 
VPHTAQRAAWKSPEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTA 
SRTVSEKHQGKAATTAKTLIPKSQHRM^ 

SPTTQRNQRLKAANFKSEPRWDFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFLDS 
RHFNQSEWDRLEHFAPPFGFMELNYSLVQKWTRFPPVPQQQLLLASLPAGSLRCITCAWGNG 
GILNNSHMGQEmSHDYWRLSGALIKGYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVPLGK 
DVRYLHFLEGTRDYEWLEALLMNQTVMSKNLFWFRHRPQEAFREALHM 

KNRFLRSKTLDGAHWMYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYBTSWKRLIFY 
mHDFKLEREVWKRLHDEGIIRLYQRPGPGTAKAKN 
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Figure 27 
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Figure 30 
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Figure 37 
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Figure 38 

A. Human ST6GalNAcI 

MRSCLWRCRHLSQGVQWSLLLAVLVFFLFALPSFIKEPQTKPSRHQRTENIKERSLQS 

LAKTKSQAPTRAI^TTr^AEPVPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDK 

VPHTAQRAAWKSPEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNG 

GQTRKLTASRTVSEKHQGKAATTAKTLIPKSQHRMLAPTGAVSTRTRQKGVTTAVIP 

PKEKKPQATPPPAPFQSPTTQRNQRLKAANFKSEPRWDFEEKYSFEIGGLQTTCPDSV 

KIKASKSLWLQKLFLPNETLFLDSREIFNQSEWDRLEHFAPPFGFMELNYSLVQKVVT 

RFPPVPQQQLLLASLPAGSLRCITCAWGNGGILNNSHMGQEIDSHDYVFRLSGALIK 

GYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVPLGKDVRYLHFLEGTRDYEWLE 

ALLMNQTVMSKMLFWFRHRPQEAFREALHMDRYLLLHP 

GAHW^IYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSWKRLIFYINH 
DFKLEREVWKRLHDEGERLYQRPGPGTAKAKN 

B. Chicken ST6GalNAcI 

MGFLIRRLPKDSRIFRWLLILTVFSFnTSFSALFGMEKSIFRQLKr^QSIAHMLQVDTQ 

DQQGSNYSANGRISKVGLERDIAWLELNTAVSTPSGEGKEEQKKTVKPVAKVEEAK 

EKVTVKPFPEVMGITNTTASTASVVERTKEKTTARPVPGVGEADGKRTTIALPSMKE 

DKEKATVKPSFGMKVAHANSTSKDKPKAEEPPASVKA1RPVTQAATVTEKKKLRAA 

DFKTEPQWDFDDEYILDSSSPVSTCSESVRAKAAKSDWLRDLFLPNITLFIDKSYFNV 

SEWDRLEHFAPPYGFMELNYSLVEEVMSRLPPNPHQQLLLANSSSNVSTCISCAVVG 

NGGILNNSGMGQEIDSHDYVFRVSGAVIKGYEKDVGTKTSFYGFTAYSLVSSLQNLG 

HKGFiaCIPQGKHIRYIHFLEAVRDYEWLKALLLDKDIRXGFLNY\"GRRPRERFDEDF 

TMNKYLVAHPDFLRYLKNE^LKSKNLQKRYWRLYRPTTGALLLLTALHLCDRVSAY 

GYITEGHQKYSDHYYDKEWKREWYVNHDFNLEKQVWKRLHDENIMKLYQRS 

C. Mouse ST6GalNAcI protein beginning at residue 32 of the native mouse protein 
DPRAKI)SRCQFIWKM)ASAQENQQKAEPQVPIMTLSPRVHNKESTSVSSKDLKKQER 
EAVQGEQAEGKEKRKLETIRPAPENPQSKAEPAAKTPVSEHLDKLPRTPGALSTRKTP 
M ATGAVP AKECKV VQ ATKSP AS SPHPTTRRRQRLKASEFKSEPRWDFEEEYSLDMS SL 
QTNCSASVKIKASKSPwXQNIFLPNITLFLDSGRFTQSEWmLEHFAPPFGFMELNQSL 
VQKVVTRFPPVRQQQLLLASLPTGYSKCITCAVVGNGGILNDSRVGREIDSHDYVFR 
LSGAVIKGYEQDVGTRTSFYGFTAFSLTQSILILGRRGFQHVPLGKDVRYLHFLEGTR 
NYEWLEAMFLNQTLAKTHLSWFRHRPQEAFRNALDLDRYLLLHPDFLRYMKNRFL 
RSKTLDTAHWRIYRPTTGALLLLTALHLCDKVSAYGFITEGHQRFSDHYYDTSWKRL 
IFYINHDFRLERMVWKRLHDEGIIWLYQRPQSDKAKN 
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Figure 41 

MASKSWLNFLTFLCGSAIGFLLCSQLFSILLGEKVDTQPNVLHNDPHARHSDDNGQN 

HLEGQMNFNADSSQHKDENTDIAENLYQKVRILCWVMTGPQNLEKKAKHVKATW 

AQRCNKVLFMSSEENKDFPAVGLKTKEGRDQLYWKTIKAFQYVHEHYLEDADWFL 

KADDDTYVILDNLRWXLSKYDPEEPIYFGRRFKPYVKQGYMSGGAGYVLSKEALKR 

F VD AFKTDKCTHS S SIEDLALGRCMEIMNVE AGDSRDTIGKETFHDPF WEHHLIKGYL 

PRTFWYWNYNYYPPVEGPGCCSDLAVSFHYVDSTTMYELEYLVYHLRPYGYLYRY 

QPTLPERILKEIS Q ANKNEDTKVKLGNP 
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Figure 43 

A. Yersinia MBP 

MKI EEGKLVI WINGDKGYNG LAEVGKKFEK DTGIKVTIEH PDKLEEKFPQ 
VAATGDGPD I I FWAHDRFGG YAQSGLLAEL TPSKAFQEKL FPFTWDAVRF 
NGKL I GYPVA VEAL S L I YNK DLVKEAPKTW EE I PALDKTL RANGKSAIMW 
NLQEPYFTWP VIAADGGYAF KFENGVYDAK NVGVNNAGAQ AGLQFIVDLV 
KNKHINADTD YS I AEAAFNK GETAMTINGP WAWSNIDKSK INYGVTLLPT 
FHGQPSKP F V GVLTAG I NAA S PNKE LATE F LENYL I TDQG LAEVNKDKPL 
GAVALKS FQE QLAKD PR I AA TMDNATNGE I MPNI PQMAAF WYATRSAVLN 
AITGRQTVEA ALNDAATRIT K 

B. E. coli MBP 

MKI EEGKLVI WINGDKGYNG LAEVGKKFEK DTGIKVTVEH PDKLEEKFPQ 
VAATGDGPD I I FWAHDRFGG YAQSGLLAEI TPDKAFQDKL YPFTWDAVRY 
NGKLIAYPIA VEALS L I YNK DLLPNPPKTW EEIPALDKEL KAKGKSALMF 
NLQEPYFTWP LIAADGGYAF KYENGKYD I K DVGVDNAGAK AGLTFLVDLI 
KNKHMNADTD YS I AEAAFNK GETAMTINGP WAWSNIDTSK VNYGVTVLPT 
FKGQPSKPFV GVLSAGINAA SPNKELAKEF LENYLLTDEG LEAVNKDKPL 
GAVALKS YEE ELAKDPRIAA TMENAQ KGE I MPNI PQMSAF WYAVRTAVIN 
AASGRQTVDE ALKDAQTNS 

C. Pyrococcus furiosus MBP 

MKIEEGKWI WHAMQPNELE VFQSLAEEYM ALSPEVEIVF EQKPNLEDAL 

KAAI PTGQGP DLF I WAHDW I GKFAEAGLLE PIDEYVTEDL LNEFAPMAQD 

AMQYKGHYYA LPFAAETVAI I YNKEMVS E P PKTFDEMKAI MEKYYDPANE 

KYGIAWPINA YFISAIAQAF GGYYFDDKTE QPGLDKPETI EGFKFFFTEI 

WPYMAPTGDY NTQQSIFLEG RAPMMVNGPW SINDVKKAGI NFGWPLPPI 

I KDGKE YWPR PYGGVKLIYF AAGIKNKDAA WKFAKWLTTS EE S I KTLALE 

LGYI PVLTKV LDDPEIKNDP VIYGFGQAVQ HAYLMPKSPK MSAVWGGVDG 

AINEILQDPQ NADIEGILKK YQQEILNNMQ G 

D. Thermococcus litoralis MBP 

MKIEEGKIVF AVGGAPNE I E YWKGVIAEFE KKYPGVTVEL KRQATDTEQR 
RLDLVNALRG KSSDPDVFLM DVAWLGQF I A SGWLEPLDDY VQKDNYDLSV 
FFQSVINLAD KQGGKLYALP VYIDAGLLYY RKDLLEKYGY SKPPETWQEL 
VEMAQKIQSG ERETNPNFWG FVWQGKQYEG LVCDFVEYVY SNGGSLGEFK 
DGKWVPTLNK PENVEALQFM VDLIHKYKIS P PNT YTEMTE EPVRLMFQQG 
NAAFERNWPY AWGLHNADDS PVKGKVGVAP LPHFPGHKSA ATLGGWHIGI 
SKYSDNKALA WE FVKFVE S Y SVQKGFAMNL GWNPGRVDVY DDPAWSKSP 
HLKELRAVFE NAVPRPIVPY YPQLSEIIQK YVNSALAGKI SPQEALDKAQ 
KEAEELVKQY SK 

E. Thermatoga maritime MBP 

MKIEQTKLTI WSSEKQVDIL QKLGEEFKAK YGIPVEVQYV DFGSIKSKFL 

TAAPQGQGAD IIVGAHDWVG ELAVNGLIEP IPNFSDLKNF YDTALKAFSY 

GGKLYGVPYA ME AVAL I YNK DYVDSVPKTM DELIEKAKQI DEEYGGEVRG 

FIYDVANFYF SAPFILGYGG YVFKETPQGL DVTDIGLANE GAVKGAKLIK 

RMIDEGVLTP GDNYGTMDSM FKEGLAAMI I NGLWAIKSYK DAGINYGVAP 

IPELEPGVPA KPFVGVQGFM INAKSPNKVI AMEFLTNFIA RKETMYKIYL 
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ADPRLPARKD VLELVKDNPD WAPTQSASM GTPMPNVPEM APVWSAMGDA 
LSIIINGQAS VEDALKEAVD KIKAQIEK 

F. Vibrio cholerae MBP 

MKIEEGQLTI WINGDKGYNG LAEVGKKFEA DTGI KVTVAH, PDALQDKFPQ 

TAATGDGPD I VFWAHDRFGG YAEAGLLVE I KPSAKIQEGI VDFAWDAVKY 

NGKI I GYP I A VESLSLIYNK DLVPNPPKSW EEVAELDAKL KKEGKSAIMW 

NLKEPYFTWP LMAADGGYAF KYGVDGYDVK DAG I NNKGVK DAMNFVKGLV 

DKGVISPDMD YSVSESAFNQ GNTAMTINGP WSWGNIEKSG INYGVTTLPK 

FNGQASKPFV GVLTAGI STA SPNKDLAVEF I ENYLLTNDG LRMVNNDKPL 

GAVALNS FQR ELDADARIAA TMDNAMNGE I MPNI PQMNAF WSSAKNAIIN 
IVDGRQTVDA ALADAEKQMT KP 
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Figure 44 
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